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ABSTRACT 

We present predictions for the abundance of sub-mm galaxies (SMGs) and Lyman- 
break galaxies (LBGs) in the ACDM cosmology. A key feature of our model is the 
self-consistent calculation of the absorption and emission of radiation by dust. The 
new model successfully matches the LBG luminosity function, as well as reproducing 
the properties of the local galaxy population in the optical and IR. The model can 
also explain the observed galaxy number counts at 850/im, but only if we assume a 
top-heavy IMF for the stars formed in bursts. The predicted redshift distribution of 
SMGs depends relatively little on their flux over the range 1-10 mJy, with a median 
value of z « 2.0 at a flux of 5mJy, in good agreement with the recent measurement 
by Chapman et al. The counts of SMGs are predicted to be dominated by ongoing 
starbursts. However, in the model these bursts are responsible for making only a few 
per cent of the stellar mass locked up in massive ellipticals at the present day. 
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1 INTRODUCTION 

The detection of populations of star-forming galaxies at high 
redshift (2 > 2) in the late 1990s opened a window on the 
process of galaxy formation when the Universe was less than 
20% of its current age. Two main techniques have been used 
to discover such objects: (1) Lyman-break galaxies (LBGs) 
are detected in optical bands from their stellar emission in 
the rest-frame ultraviolet, using the spectral break around 
912A produced by absorption by intervening neutral hydro- 
gen (Steidel et al. 1996) . The largest samples of LBGs have 
been obtained for z ~ 3 — 4 (Steidel et al. 1999). The dust 
extinction corrections required to derive star formation rates 
of LBGs from their rest-frame UV luminosities are probably 
quite large (a factor of ~ 3 — 10) (e.g. Meurer et al. 1999; 
Adelberger & Steidel 2000), but remain uncertain, this be- 
ing a major obstacle preventing an accurate determination 
of the contribution of LBGs to the global star formation 
rate. (2) Sub-mm galaxies (SMGs) are detected by emission 
from warm dust in the rest-frame far-IR/sub-mm (Small et 
al. 1997; Hughes et al. 1998). From their discovery using the 
SCUBA instrument on JCMT, the faint mjy-flux SMGs 
were suspected to lie at high redshifts (e.g. Yun & Carilli 
2002) and this has now been firmly established by mea- 
surements of spectroscopic redshifts, which give a median 
redshift z ~ 2.2 — 2.4 for sources with 850/im fluxes around 



5mJy (Chapman et al. 2003, 2004). The SMGs are generally 
interpreted as dust-enshrouded starbursts, in which the dust 
is being heated by UV radiation from young stars, but esti- 
mates of their star formation rates, based on the measured 
850/im fluxes, are uncertain because of a lack of knowledge 
of the shapes of their spectral energy distributions (SEDs) 
at the shorter wavelengths which presumably dominate the 
bolometric luminosity of the dust. There also remains the 
possibility that a significant fraction of the sub-mm emis- 
sion may be powered by heating of dust by AGNs rather 
than young stars. Recent X-ray observations suggest that 
the AGN contribution to the sub-mm emission is small in 
most SMGs (Alexander et al. 2003), but this issue will only 
be finally settled by further observational data. 

A third technique that has been used to search for high- 
redshift star-forming galaxies is to look for the Lya emission 
line (e.g. Hu, Cowie & McMahon 1998). The usefulness of 
Lya emission as a quantitative star formation indicator is, 
however, limited by the fact that many star-forming galaxies 
do not show Lya in emission (e.g. Steidel et al. 2000), and 
that resonant scattering of Lya photons by neutral hydrogen 
makes corrections for dust extinction even more uncertain 
than for the UV continuum. Thus, more reliable estimates 
of the star formation rates in these systems typically use 
measurements of the rest-frame UV luminosity, as in LBGs. 
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We present predictions from our semi-analytical model for 
Lya emitting galaxies in a separate paper (Le Delliou et al. 
2004), but do not consider them further here. 

Within the uncertainties, galaxies detected as LBGs 
and as SMGs appear to contribute comparably to the global 
star formation rate at high redshifts (e.g. Small et al. 2002). 
In combination, they should, in principle, provide a census 
of all high-mass star formation at redshifts z ^ 2, since one 
should either see the UV radiation from these stars directly 
(as in LBGs) or after reprocessing by dust (as in SMGs). It 
is therefore of fundamental importance to understand both 
types of galaxy, in order to understand the process of galaxy 
formation as a whole. In this paper, we describe the main 
results of our attempt to do this within the framework of 
the CDM model of structure formation; subsequent papers 
will describe different aspects of our model in more detail. 

We build on our previous work using semi-analytical 
models of galaxy formation to calculate the evolution of the 
galaxy population ab initio. These models predict the mass 
accretion, merging and star formation history of each galaxy, 
along with the evolution of its gas mass and metallicity, and 
the mass and size of the disk and bulge. In Baugh et al. 
(1998) we interpreted the first observational results on LBGs 
in terms of a hierarchical model of galaxy formation, but 
without including any treatment of dust. In Granato et al. 
(2000, hereafter GOO), we presented a new, multi-wavelength 
version of our model which included a comprehensive treat- 
ment of the reprocessing of stellar radiation by dust (using 
the GRASIL code developed by Silva et al. 1998, hereafter 
S98), and showed that it reproduced the luminosity func- 
tions and other properties of galaxies ai z — all the way 
from the far-UV to the far-IR. However, as we show in this 
paper, when the GOO model in its original form is applied 
to the high-z universe, it dramatically underpredicts both 
the number density of bright LBGs and the number counts 
of SMGs at 850^m. Of these, the latter problem seems the 
most serious, since the observed sub-mm fluxes of SMGs are 
usually interpreted as requiring objects with star formation 
rates in excess of lOOMoyr"^ (for a standard initial mass 
function) to be commonplace; such vigorous star formation 
rates at high redshifts are difficult to accommodate in the 
CDM model. To see if it is possible to achieve such high sub- 
mm luminosities in high-z galaxies, we have therefore made 
significant modifications to our original model, in regard to 
both star formation and the behaviour in bursts, in order 
to arrive at a new self-consistent model, which at the same 
time reproduces the properties of the LBGs and SMGs at 
high redshift and the main properties of the galaxy popula- 
tion at low redshift. We present here the first results from 
this new model. 

The challenge of modelling galaxies in the high redshift 
universe has received much attention in the past decade. 
Simple theoretical and empirical models have been pre- 
sented which use the number counts of galaxies observed 
at different wavelengths to try to constrain the global star 
formation history of the Universe (e.g. Pearson & Rowan- 
Robinson 1996; Blain et al. 1999a, 1999b; Chary & Elbaz 
2001; Balland, Devriendt & Silk 2003). More sophisticated 
models that attempt to follow the physics of galaxy for- 
mation in greater detail have stressed the importance of 
bursts of star formation at high redshift in explaining the 
abundance of SMGs (Guiderdoni et al. 1998; Devriendt & 



Guiderdoni 2000) and LBGs (Somerville, Primack & Faber 
2001). Most recently, Granato et al. (2004) have developed 
a coupled model for the evolution of galactic spheroids and 
QSOs, which successfully reproduces the counts of SMGs 
and the evolution of the K-band luminosity function with 
redshift (e.g. Pozzetti et al. 2003; Drory et al. 2004). 

Our methodology represents a number of significant ad- 
vances over earlier attempts to model the SMGs and LBGs 
in the framework of hierarchical galaxy formation. Previous 
work has tried to model either SMGs (e.g. Guiderdoni et al. 
1998; Blain et al. 1999b) or LBGs (e.g. Baugh et al. 1998; 
Somerville, Primack & Faber 2001), but not both popula- 
tions together. This allowed them to tune their model pa- 
rameters to fit either one population or the other. In this pa- 
per, we try to explain both populations together. Our model 
has the following important features, which improve on ear- 
lier work: (i) It directly predicts the merger histories of dark 
matter haloes and of the galaxies contained within them. 
This allows us to predict the redshifts and gas masses of 
bursts of star formation triggered by galaxy mergers, (ii) The 
absorption of starlight by dust is calculated self-consistently 
by radiative transfer, using the dust mass obtained from the 
gas mass and metallicity, and the disk and bulge scalelengths 
predicted by the model. This is particularly important when 
computing the amount of extinction at short wavelengths. In 
turn, this determines the amount of energy absorbed by the 
dust and re-radiated at long wavelenths. (iii) The spectrum 
of dust emission is calculated using a distribution of dust 
temperatures within each galaxy, which is obtained by solv- 
ing for the radiative equilibrium temperatures of individual 
dust grains. 

In contrast to our detailed treatment of dust emission, 
previous works either treated the dust temperature as a free 
parameter in fitting the SMC number counts (e.g. Kaviani, 
Haehnelt & Kauffmann 2003), or used empirical IR/sub- 
mm SEDs estimated from present-day galaxies (e.g. Pear- 
son & Rowan-Robinson 1996; Blain et al. 1999a; Devriendt 
& Guiderdoni 2000) . The task of fitting the observed num- 
ber counts of SMGs is much simpler if the dust tempera- 
ture is treated as a free parameter: for a dust emissivity 
El/ oc u^ , bolometric dust luminosity Ld, and temperature 
Td, the luminosity per unit frequency at long wavelengths 
scales as L^ oc u'^'^"Ld/T^, which varies roughly as T^^ 
at fixed Ld for /3 ~ 2. This might suggest that it is easy 
to reconcile any theoretical model with the observed sub- 
mm counts simply by making modest adjustments to the 
assumed dust temperature. However, treating the dust tem- 
perature as a free parameter is physically inconsistent, since 
it is actually determined by the condition that the dust 
grains be in thermal equilibrium between radiative heating 
and cooling. For dust mass Mj, with a single dust temper- 
ature, thermal equilibrium implies Ld oc MdT^ , so that 
L, oc ,.2+/5Mf +«/(*+'3'Ly ("+'3' if Md and Ld are treated as 
the physical parameters, rather than Td and Ld- This argu- 
ment shows that once one imposes the condition that dust 
temperatures be physically consistent, the predicted sub- 
mm luminosities in a model can only be changed by large 
factors by making large changes to the dust masses. 

Additional shortcomings of previous works are that they 
either did not include halo and galaxy mergers (e.g. Guider- 
doni et al. 1998), or treated the amount of dust extinction 
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in high-z galaxies as a free parameter (e.g. Somerville et al. 
2001). 

In the next Section we explain the major changes to the 
model since GOO. We present the results from our model in 
Section 13 and our conclusions in Section |1] 



2 THE MODEL 

We now give a brief overview of our new galaxy formation 
model. In Section r2.1l for completeness, we list some general 
differences from the model presented by GOO, which were 
motivated by changes in the preferred values of the cosmo- 
logical parameters and by improvements in the modelling 
of feedback processes (Benson et al. 2003; hereafter BOS). 
We demonstrate in Section |3] that neither the GOO model 
nor the model presented by BOS can reproduce the number 
counts of sub-mm galaxies. The additional changes required 
to the model of BOS to rectify this are set out in detail in 
Section 12.21 The impact of these changes on the predicted 
sub-mm counts is demonstrated in Fig. |K|in Section |21 Full 
details of the semi-analytic galaxy formation model can be 
found in Cole et al. (2000) and BOS. The GRASIL model for 
the emission from stars and dust is described by S98, and 
its implementation in the semi-analytic model is described 
in GOO. A full description of our new model will be given in 
Lacey et al. (2004a). 



2.1 The basic model 

The galaxy formation model from which we start is the 
one described by BOS. This contains a number of changes 
from the model of GOO. We assume a flat, ACDM cosmol- 
ogy, with density parameter flo = O.S, a Hubble constant of 
h = //o/(100kms~^Mpc — 0.7 and with a perturbation am- 
plitude set by the linear rms fluctuation in spheres of radius 
8/i"^Mpc of o-g = 0.9. 

One of the major differences is motivated by a revision 
in the value of the cosmological baryon density, fit, to be 
consistent with recent constraints from Big Bang Nucleosyn- 
thesis models (O'Meara et al. 2001) and from measurements 
of fluctuations in the cosmic microwave background radia- 
tion (e.g. Spergel et al. 200S). We adopt a value of fib = 0.04, 
which is twice that used by GOO. To prevent the formation 
of too many luminous galaxies, BOS found it necessary to 
invoke a "superwind" mode of feedback, in which cold gas 
is ejected from a galaxy in proportion to the star formation 
rate. The superwind operates in addition to the feedback 
used by GOO (see Cole et al. 2000). There is observational 
support for the existence of superwinds from the detection 
of high- velocity outflows in massive galaxies at both low and 
high redshift (e.g. Martin 1999; Heckman et al. 2000; Pettini 
et al. 2002; Adelberger et al. 200S; Small et al. 200S). 

The BOS model also includes a simple treatment of the 
feedback from photoionization of the intergalactic medium, 
which suppresses the collapse of gas into low mass dark halos 
after reionization. We model this by assuming that no gas 
cools in halos with circular velocities Vc < 60kms~^ at z < 6. 
This has a similar effect on the predicted form of the galaxy 
luminosity function to the more realistic and sophisticated 
treatment developed by Benson et al. (2002b). 

The GRASIL code computes the emission from both the 



stars and dust in a galaxy, based on the star formation and 
metal enrichment histories predicted by the semi-analytical 
model. It includes radiative transfer through a two-phase 
dust medium, with a diffuse component and giant molecu- 
lar clouds, and a distribution of dust grain sizes. Stars are 
assumed to form inside the clouds and then gradually leak 
out. The output from GRASIL is the galaxy SED from the 
far-UV to the sub-mm. The motivation for the choice of dust 
parameters in the GRASIL model and the impact of varying 
these choices are discussed in detail in GOO (see also Fronton 
et al. 2004). 



2.2 Key features of the new model 

In Section |21 we will show that the predictions of the GOO 
and BOS models do not match the observed number counts 
of sub-mm sources. The positive effect on the number counts 
of increasing the baryon fraction by a factor of two between 
GOO and BOS is offset by the inclusion of superwinds in the 
latter. It is therefore necessary to make a number of im- 
portant changes to the BOS model in order to reproduce 
the SMC counts. Our freedom to change the BOS model is, 
however, restricted for two reasons. Firstly, we retain the 
cosmological parameters adopted by BOS. This sets the rate 
at which structure grows in the dark matter, which in turn 
determines the halo merger trees in which galaxies form and 
evolve. Secondly, we follow the philosophy set out by Cole 
et al. (2000), and demand that any new model should repro- 
duce basic properties of the local galaxy population, such as 
the luminosity functions in the optical, near-IR and far-IR, 
the distribution of disk scale sizes, the gas and metal con- 
tents of disks and spheroids, and the mix of morphological 
types. The task of finding a successful model is made even 
more demanding by our use of a self-consistent calculation of 
the absorption and re-emission of radiation by dust. We em- 
phasize again that the temperature distribution of the dust 
grains is not a free parameter in our model, but is deter- 
mined by the stellar luminosity and the mass, distribution 
and composition of the dust. We largely retain the same 
parameters in the GRASIL code as used by GOO, with modi- 
fications to the escape time of young stars from dust clouds 
and to the long-wavelength emissivity of the dust grains, 
which we discuss in the next sections. 

The model set out below was reached after an extensive 
exploration of parameter space, within the boundaries set 
by the stringent constraints outlined above, with the goal of 
improving the model predictions for the high redshift galaxy 
population. The consequences of systematically varying key 
model assumptions will be illustrated in Section |21 

Star formation timescale in disks: The star formation 
rate is given hy ^l> = Mgas/r*, where Mgas is the cold 
gas mass. GOO and BOS assumed that the star formation 
timescale in disks, r*, incorporated a scaling with the dy- 
namical time of the disk. In this case, the star formation 
timescale becomes shorter with increasing redshift. Conse- 
quently, mergers at high redshift tend to be between gas- 
poor disks, with little fuel available to power a starburst. 
In order to move more star formation from the quiescent to 
the burst mode at high redshifts, we adopt a prescription for 
the star formation timescale in disks without the dynamical 
time scaling: 
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r*o(Vc/200kms"')' 



(1) 



where t^q is a constant. This parametrization of the star for- 
mation timescale is similar to that employed in our earher 
models (Cole et al. 1994; Baugh et al. 1996, 1998). There 
are no compelling theoretical arguments for preferring one 
of these prescriptions over the other. (We test the star for- 
mation recipe in our models against observational data in 
Bell et al. 2003.) In both cases, appropriate parameter val- 
ues can be chosen to reproduce the observed gas fraction- 
luminosity relation a.t z = Q (see Fig. 9 of Cole et al. 2000) ; 
here, we choose rto = 8Gyr and q* = —3. In the new model, 
galactic disks at low redshift have roughly the same star 
formation timescales as in GOO, but disks at high redshifts 
have much longer star formation timescales than before. As 
a consequence, in the new model disks at high redshift are 
much more gas-rich than in GOO, resulting in more gas be- 
ing available for star formation in starbursts triggered by 
mergers between these disks. This type of picture is hinted 
at observationally by the very large gas fractions implied by 
CO measurements of SMGs (e.g. Neri et al. 2003). Similar 
schemes resulting in gas-rich mergers at high redshift have 
previously been invoked to explain the luminosity function 
of Lyman-break galaxies (Somerville et al. 2001) and the 
evolution of the quasar population (Kauffmann & Haehnelt 
2000). 

Star formation bursts triggered by galaxy mergers: 
Bursts of star formation are assumed to be triggered only 
by galaxy mergers. We define a galaxy merger as major or 
minor according to whether the ratio of masses of the two 
galaxies exceeds /eiiip or not. In major mergers, the stel- 
lar disks are assumed to be transformed into a new stel- 
lar spheroid, while in minor mergers, the stellar disk of the 
larger galaxy is preserved. Bursts are assumed to be trig- 
gered in all major mergers (as in GOO), but also (unlike 
in GOO) in minor mergers which satisfy both of the fol- 
lowing conditions: (i) the galaxy mass ratio exceeds /burst 
(where /burst < /ciup); (ii) the gas fraction in the primary 
galaxy exceeds /gas,crit. The latter condition is motivated 
by the suggestion of Hernquist & Mihos (1995) that gas- 
rich disks should be more susceptible to bursts triggered 
by the accretion of small satellites. We adopt values of 
/ciiip = 0.3, /burst = 0.05 and /gas.crit = 0.75. Another mod- 
ification is that the timescale of star formation in a burst 
is taken to be r*burst = max [/dynTdyn,T*burst,min], where 
Tdyn is the dynamical time of the newly formed spheroid 
(GOO assumed T*burst = /dynTdyn). Wc adopt /dyn = 50 and 
''"*burst,min = 0.2Gyr; these values were found to produce the 
best match to the present day GOum luminosity function and 
the abundance of SMGs. 

The IMF in bursts: Perhaps the most dramatic change 
from our previous work is the adoption of a top-heavy ini- 
tial mass function (IMF) for stars formed in bursts. This 
is the single most important change to the model, which 
has the biggest impact on the predicted counts of sub-mm 
sources. For stars formed quiescently in disks, we use a solar 
neighbourhood IMF, with the form proposed by Kennicutt 
(1983): diV/dlnm oc m~^, with x = 0.4 for m < IMq and 
X = 1.5 for m > IMq. In bursts, we instead use a single 
power law with a; = 0. In both cases, the IMF covers a 
mass range 0.15 < in < 125M0. (In GOO, we used a Ken- 
nicutt IMF for all star formation.) There are two factors 



motivating the adoption of a top-heavy IMF in starbursts. 
Firstly, the total energy radiated in the UV per unit mass 
of stars formed is increased due to the larger fraction of 
high mass stars, which increases the amount of radiation 
heating the dust. For example, the energy per unit mass 
emitted at 1500A is around 4 times larger for the top-heavy 
X = IMF compared to the Kennicutt IMF. Secondly, a 
top-heavy IMF also results in a higher yield of metals from 
Type II supernovae, so that more dust is produced to absorb 
this energy. For stars formed with the top-heavy IMF, more 
than twice the mass is recycled to the interstellar medium 
and over six times as many metals are produced, compared 
with the case of stars formed with a Kennicutt IMF. The in- 
creased production of dust for a top-heavy IMF is essential 
for boosting the luminosity of galaxies in the sub-mm range, 
as discussed in the Introduction. If the stellar UV luminosity 
were to be increased without changing the mass of dust, the 
dust would be heated to higher temperatures, shifting the 
peak of the dust emission spectrum to shorter wavelengths, 
and resulting in relatively little increase in the luminosity at 
wavelengths longwards of the peak (including the sub-mm) . 
Including the increased dust production, using a top-heavy 
IMF results in the most efficient production of luminosity 
at sub-mm wavelengths for a given rate of star formation. 



3 RESULTS 

We first make some general comparisons between the predic- 
tions of the GOO model and our new model, before focusing 
on the properties of galaxies at high redshift. 

Cosmic star formation history: We compare the global 
star formation histories in GOO and the new model in Fig. 
The overall star formation rate in the new model (Fig. ^) 
is similar to that in GOO (Fig.^). There is a broad peak in 
the total star formation density at z ~ 2 — 3, corresponding 
to a look-back time of ~ lOGyr. However, in contrast to 
GOO, where star formation in bursts is around 5 per cent 
of the total at all redshifts, in the new model the fraction 
of star formation occuring in bursts increases sharply with 
redshift, and exceeds that in quiescent disks for z>3. This 
increase in the importance of bursts at high-z relative to GOO 
results both from the change in the star formation timescale 
in quiescent disks, which causes them to be more gas-rich at 
high z, and from allowing bursts to be triggered by minor 
mergers. The fraction of star formation in bursts summed 
over the history of the universe remains modest, however. 
Only 30 per cent of star formation takes place in the burst 
mode in the new model. The contribution of bursts to the 
mass locked up in stars is even less impressive. Taking into 
account the large fraction of mass recycled by dying stars 
with the top-heavy IMF, less than 7 per cent of the mass 
locked up in stars today is predicted to have been made in 
bursts. 

Present-day galaxy luminosity function: The present- 
day galaxy luminosity function plays an essential role in 
constraining the parameters of the galaxy formation model. 
Our philosophy is to consider only those models that pro- 
duce reasonable matches to the optical galaxy luminosity 
function, tracing the stellar population, and the far-IR lu- 
minosity function, tracing the emission from dust. Fig. |5| 
compares the predictions of the new model with the ob- 
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Figure 1. The visible star formation rate (i.e. excluding brown 
dwarfs) per unit volume as a function of redshift. (a) The predic- 
tions of the new model presented in this paper, (b) The equivalent 
results for the GOO model. In both cases, the solid line shows the 
total star formation rate density, the dotted line indicates the qui- 
escent star formation in galactic disks, and the dashed line shows 
the star formation in bursts triggered by galaxy mergers. 



served luminosity functions at z = in the optical (B-band) 
and in the far-IR (60/im). In Fig. |2| (and Fig. O, the error 
bars on the model curves represent the statistical uncertain- 
ties resulting from the finite sample of galaxies computed; 
they are calculated as described in GOO. The new model 
reproduces the local optical and far-IR luminosity function 
data almost as well as the model of GOO. As in GOO, ongoing 
bursts dominate the bright end of the 60^m luminosity func- 
tion but are less important in the optical. The new model 
model also reproduces other properties of the local galaxy 
population, such as the ratio of gas mass to luminosity as a 
function of luminosity and the stellar metallicity versus lu- 
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Figure 2. The predictions for the optical (top) and far-IR (bot- 
tom) luminosity functions at 2 = for the new model. The dotted 
lines show the luminosity function of galaxies that are forming 
stars quiescently, the dashed lines show the contribution from on- 
going bursts, and the solid lines show the total luminosity func- 
tion. In the upper panel, the dot-dashed line shows the predicted 
luminosity function in the absence of dust extinction. The error 
bars on the model predictions indicate the Poisson error arising 
from the number of galaxies simulated. The open symbols with 
error bars show the observational data, taken from the sources 
given in the legend. 



minosity relation (see figures 9 and 10 from Cole et al. 2000, 
which are very similar to the results from our new model). 
Further comparisons of the new model with observed prop- 
erties of present-day galaxies will be given in Lacey et al. 
(2004a). 

Lyman-break galaxies: A very important prediction of 
the model is the galaxy luminosity function in the rest-frame 
UV at high redshift. This depends both on the distribution 
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Figure 3. The luminosity function in the rest-frame UV at 
2 = 3. The top and bottom panels respectively show the pre- 
dictions from the new model presented in this paper and that of 
GOO. In both panels, the symbols show the observational estimate 
of the rest-frame UV luminosity function of LBGs at 2 Pd 3 ob- 
tained by Steidel et al. (1999) by combining ground-based and 
HST data. The rest-frame wavelength of 0.17/im corresponds to 
the observer-frame 7?.-band used by Steidel et al. for 2 = 3. In 
each panel, the solid line shows the predicted total luminosity 
function including dust extinction, while the dotted and dashed 
lines show the contributions to this from quiescent galaxies and 
ongoing bursts respectively. The dot-dashed line shows the total 
luminosity function without dust extinction. 
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Figure 4. The cumulative number counts per square degree at 
850fj,m. The model predictions are shown by the lines: the total 
counts are shown by the solid line, and the contributions from 
bursts and quiescent galaxies by the short-dashed and dotted lines 
respectively, (a) Shows the counts predicted by the new model and 
(b) shows the predictions of the GOO model. The symbols show a 
compilation of observational data, as given in the key. 



of galactic star formation rates and on the amount of extinc- 
tion by dust. We plot the rest-frame UV luminosity function 
at 2 = 3 in Fig. |2l comparing the model predictions with 
observational data for LBGs. The top panel shows the pre- 
dictions of our new model, while the lower panel shows the 
fiducial model of GOO. The figure shows that the GOO model 
would match the observed UV luminosity function if there 
were no dust extinction, but once the effects of dust have 
been included in a consistent way, the GOO model is typi- 
cally 2 magnitudes too faint at a given galaxy abundance. 



In the new model, the dust-exticnted UV luminosity func- 
tion matches the observations well. The luminosity function 
would be brighter by 1.5-2.5 magnitudes or a factor 4-10 in 
luminosity in the absence of dust. The level of extinction in 
our model galaxies is similar to that inferred by Steidel et 
al. (1999) using a simple extinction law and the observed 
G ~ R colours of their Lyman-break sample. However, a 
detailed comparison requires reproducing the observational 
LBG selection in our model, and we defer this to a future 
paper (Lacey et al. 2004b). 
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Figure 5. The eflFect of varying key model parameters and assumptions on the predicted cumulative number counts per square degree 
at 850fj,m. The total source counts predicted by the new fiducial model are shown by the solid line reproduced in each panel. The dashed 
lines show the predicted total counts in variant models: (a) when a standard solar neighbourhood IMF is used in bursts of star formation, 
(b) when the emissivity of the dust in bursts matches that in quiescent galaxies, (c) when bursts triggered by minor mergers are switched 
off and (d) when the condition that the primary galaxy be gas rich in order to trigger a burst in a minor is switched off. The observational 
data are the same as those plotted in Fig. |4] 



Since, on average, only a small fraction of the UV light 
escapes from galaxies with large intrinsic UV luminosities, 
the dust-extincted UV luminosity function is sensitive to the 
parameters of the dust model. In the new model, we have 
adopted a value for the escape time of stars from molecu- 
lar clouds of fcsc ~ IMyr for both bursts and quiescent star 
formation in order to match the observed luminosity func- 
tion of LBGs. For this choice, ongoing bursts dominate the 



extincted UV luminosity function at high z. However, our 
current choice of dust parameters may not be the only one 
that allows a match to the observational data. The sub-mm 
emission from these galaxies is much less sensitive than the 
UV emission to the details of the dust model, such as cloud 
masses, radii and escape times, provided that most of the 
UV light is absorbed. The impact of varying key assump- 
tions of the new model on the abundance of Lyman-break 
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galaxies will be explored in a forthcoming paper, in which we 
will also explore in detail the connection between the sub- 
mm fluxes and rest-frame UV luminosities of high-z galaxies, 
and between SMGs and LBGs (Lacey et al. 2004b). 

Sub-mm galaxies: The other crucial constraint on high- 
redshift galaxies comes from the number counts of faint 
SMGs. Fig. |1] shows the cumulative number counts as a 
function of flux for galaxies selected by their emission at 
850/^m. The model predictions are compared to a compila- 
tion of data obtained by different surveys using SCUBA. The 
model of GOO, shown in Fig.0|3, is seen to underpredict the 
observed counts, by a factor ~ 20 at 3mJy. The new model 
presented in this paper is much more successful (Fig. |lji), 
agreeing spectacularly well with the observations. To obtain 
a better fit to the number counts of SMGs, we have mod- 
ified the emissivity of dust grains in bursts at wavelengths 
A > 100/im, from the canonical e^ oc v^ (which we retain 
for quiescent star formation, and which GOO used for both 
modes of star formation), to e^ oc u^'^. There is indepen- 
dent evidence that the dust emissivity in some starbursts 
may be flatter than e^ oc u^: S98 found that modelling the 
SED of Arp 220 favoured ei^ oc t" ' .As reviewed by Draine 
(2003), simple theoretical models of dust grains predict that 



the dust emissivity should vary as Ei/ 



at long enough 



wavelengths, independently of the grain size distribution. 
However, laboratory studies of some possible dust materials 
find a much shallower dependence than this even at sub-mm 
wavelengths. Assuming an emissivity like e^ oc u^'^ thus re- 
mains controversial. We illustrate the impact on our predic- 
tions of reverting to the canonical choice of e„ oc u^ later on 
in this section. 

In the new model, the counts are dominated by ongoing 
bursts at fluxes 0.1 < S'^(850/.im) < 30mJy. Bursts account 
for a factor of 10"^ times more sources at 3mJy in the new 
model than they did in GOO; the counts from sources that 
are not experiencing an ongoing burst have increased by a 
more modest factor, ~ 1.5-6, depending on the flux. 

We now investigate the impact on the counts of SMGs 
of systematically switching off, one at a time, the changes 
made in the new model. In Fig. |K| the total counts pre- 
dicted by the new model are shown by the solid line, which 
is reproduced in each panel. The total counts in the variant 
models are shown by the dashed line in each panel. Note 
that in this experiment, we have not altered any additional 
parameters beyond the ones stated; however, in all cases, 
the variant models predict optical luminosity functions at 
z = that are in reasonably good agreement with the ob- 
servational estimates. In Fig. (KJa), we illustrate the effect 
of using a standard Kennicutt IMF in both modes of star 
formation. This has the biggest impact on the model predic- 
tions, reducing the counts by a factor of ~ 60 at 3mJy. The 
resulting total counts are similar to those predicted by the 
GOO model. Changing the emissivity of the dust in bursts to 
be the same as in the quiescent mode, e^, cuv^, has a much 
less dramatic effect, reducing the counts by less than a fac- 
tor of 2 at 3mJy (Fig. [SJa). Given the observational scatter, 
this variant could be a viable alternative model, especially 
if dust-obscured active galactic nuclei made a comparable 
contribution to the 850/xm emission to that of dust-obscured 
starbursts (although Alexander et al. 2003 argue that this 
is unlikely to be the case). 

The importance of bursts of star formation triggered 
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Figure 6. The predicted redshift distributions of sources se- 
lected by their 850/im flux. Each panel corresponds to sources at 
a particular flux, as indicated, increasing from 1 mJy at top left 
to Sinjy at bottom right. The solid line shows the redshift dis- 
tribution for all galaxies, and the dashed and dotted lines show 
the contributions to this from ongoing bursts and quiescently star 
forming galaxies respectively. The redshift distributions at each 
flux are normalized to unit area for the total counts. The median 
redshift 250 and also the redshift 290 below which 90% of the 
galaxies lie are given in each panel. 



by minor mergers is illustrated in Fig. EIc), in which we 
show how the predicted counts change when minor merger 
bursts are turned off by setting /burst ~ /ciup- The counts 
at 3 mJy are down by a factor of « 7 without the contribu- 
tion of minor merger bursts. Finally, in Fig. Od) we show 
how the predicted counts are affected when we relax the 
condition for triggering bursts in minor mergers that the 
gas fraction in the primary galaxy should exceed /gas.crit, by 
setting /gas.crit = 0. This variant model slightly overpredicts 
the 850/im source counts. However, this model greatly over- 
predicts the number of bright 60/im galaxies at the present 
day. As a direct consequence of the parameterization of the 
star formation timescale (discussed in Section 12.21 above), 
galactic disks at high redshift tend to be gas-rich in the new 
model, whereas they become progressively more gas-poor as 
2; = is approached, through the depletion of gas by qui- 
escent star formation. The gas fraction threshold therefore 
has the effect of suppressing most minor merger bursts at 
low redshift, but has relatively little effect at high redshift. 
In Fig. |S| we show what the new model predicts for 
the redshift distributions of sources selected at 850pm. The 
median redshift is seen to vary little over the flux range 
1 < S'i/(850/im) < 8mJy, from 250 = 2.3 at the faint end to 
^50 = 1.7 at the bright end. At a flux of 5mJy, the median 
redshift is predicted to be 2.0, in good agreement with the re- 
cent estimate by Chapman et al. (2003, 2004) from a sample 
of SMGs with spectroscopic redshifts. Fig.^lalso shows that 
the redshift distributions for the quiescent galaxies peak at 
lower redshifts than the bursts. The quiescent galaxy peak 
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is at very low redshifts for the brighter part of the fiux range 
shown. Low-z quiescent galaxies are predicted to dominate 
the total counts at fluxes ^^(SSO/im) >30mJy. 

We will discuss the predictions of the new models for the 
properties of the SMGs and LBGs at length in Lacey et al. 
(2004b). In summary, at z = 2, SMGs with Sgsoyjm > 5mJy 
have median stellar masses of ~ 10 h~ Mq and reside in 
haloes with a median mass of ~ lQ^^h~^MQ. The SMGs are 
thus predicted to reside in the more massive haloes in place 
at z = 2, and therefore will be more strongly clustered than 
the dark matter at this epoch (Baugh et al. 2004), consistent 
with tentative observational constraints (Blain et al. 2004). 



4 DISCUSSION AND CONCLUSIONS 

We have combined two powerful theoretical techniques, 
semi-analytical modelling of galaxy formation and radiative 
transfer modelling of the reprocessing of stellar radiation by 
dust, to address the issue of how faint sub-mm galaxies fit 
into current theories of galaxy formation. Our approach to 
this problem represents a significant advance over previous 
work in this area for two reasons. Firstly, our overriding 
philosophy is to require that any acceptable model must re- 
produce a basic set of observational data for present-day 
galaxies, including the optical, near-IR and far-IR luminos- 
ity functions, gas fractions and metallicities, and the distri- 
bution of disk scale sizes. Secondly, the dust extinction and 
emission are computed self-consistently, rather than being 
put in by hand. Our adopted methodology severely restricts 
the range of available parameter space, making a successful 
reproduction of the high redshift galaxy population more 
challenging, but, at the same time resulting in a model with 
more predictive power. 

The new model presented in this paper extends the suc- 
cesses of the GOO model to high redshift. Our model matches 
the measured rest-frame UV luminosity function of Lyman- 
break galaxies at z ~ 3 (Steidel et al. 1999). At the same 
time, it reproduces the number counts of 850^m sources, 
and predicts redshifts for these sources in very good agree- 
ment with recent measurements from Chapman et al. (2003, 
2004). This is the first time that any realistic CDM-based 
galaxy formation model has successfully matched the prop- 
erties of both major populations of star-forming galaxies at 
high redshift. 

In order to achieve these successes while still remain- 
ing within the framework of the hierarchical assembly of 
galaxies, we found it necessary to enhance the importance 
of bursts at high redshift, by: (i) modifying the quiescent 
star formation timescale in galactic disks, in order to make 
mergers at high redshift more gas-rich; (ii) allowing trigger- 
ing of bursts by minor as well as major mergers; and (iii) 
adopting a top-heavy IMF in bursts. 

Our new model shares some features with those of 
Guiderdoni et al. (1998) and Devriendt & Guiderdoni 
(2000). They also proposed that the fraction of star forma- 
tion occuring in bursts increased strongly with redshift, and 
suggested that the IMF in bursts was top-heavy, in order 
to explain the cosmic far-IR background and the sub-mm 
number counts. However, in other respects, their model is 
very different from ours. In particular, their model does not 
include galaxy mergers, so the rate of starbursts as a func- 



tion of redshift is put in by hand. Also, they use empirical 
SEDs for the IR/sub-mm emission, rather than calculating 
dust temperatures from radiative equilibrium. 

The most controversial of our changes is the adoption of 
a top-heavy IMF in bursts of star formation. For simplicity, 
we have chosen to use a flat IMF in starbursts. We have 
explored a range of tilted and truncated IMFs. A flat IMF 
slope is not essential to match the number counts of SMGs; 
we found that reasonable matches to the counts could be 
obtained with slopes given by a; < 0.35. The critical point 
is that the relative proportion of high mass to low mass 
stars produced in a starburst should be greater than it is in 
quiescent star formation. 

We are by no means the first to suggest the abandon- 
ment of a universal IMF. Some theoretical calculations pre- 
dict a form for the IMF that is biased towards more massive 
stars, compared with the Kennicutt (1983) IMF, under the 
conditions that exist in starbursts (e.g. Padoan et al. 1997; 
Larson 1998). However, the bulk of the support for a varia- 
tion in the IMF is indirect, resulting from the challenge of 
matching observations of early-type galaxies and explaining 
the metal content of the hot gas in clusters (e.g. Renzini 
et al. 1993; Chiosi et al. 1998; Gibson & Matteucci 1997; 
Thomas et al. 1999). The metal content of the intracluster 
medium and stellar absorption line strengths in ellipticals 
can be more readily explained if significant star formation 
occured with an IMF which had a higher fraction of mas- 
sive stars compared to that in the solar neighbourhood. This 
boost in the relative proportion of high mass stars can be 
achieved either by tilting the IMF or by truncating it be- 
low a fixed stellar mass. Nagashima et al. (2004) have taken 
the new model described in this paper and incorporated a 
detailed treatment of chemical enrichment by Type la and 
Type II supernovae. The new model reproduces the observed 
abundances of O, Mg, Si and Fe in the intracluster medium 
of X-ray clusters. Nagashima et al. find that reverting to a 
standard IMF in bursts leads to the model underpredicting 
the abundances of these elements by a factor of 2-3. 

Granato et al. (2004) have recently presented an alter- 
native model which can explain the sub-mm number counts 
without assuming either a top-heavy IMF or a modified dust 
emissivity, based on spheroid formation by monolithic col- 
lapse at high redshift. They posit that the formation of very 
massive stellar spheroids occurs much earlier than in the 
model presented here, through the joint effects of very strong 
feedback from QSOs suppressing the cooling of gas in small 
halos at high redshift, and a rate of gas cooling in high- 
mass halos that is greatly enhanced by clumping effects. 
This model also reproduces the inferred evolution of the K- 
band luminosity function (e.g. Pozzetti et al. 2003: Drory 
et al. 2004). However, unlike the model presented here, the 
Granato et al. model does not include the effects of either 
halo or galaxy mergers, and does not currently treat the for- 
mation of galactic disks. The differences between these two 
models highlight the need to develop a better physical un- 
derstanding of the processes of gas cooling and feedback, in 
order to understand whether the assembly history of galac- 
tic spheroids was closer to the "monolithic" or "hierarchical" 
pictures. 

The relative contributions to the 850/im fluxes of SMGs 
from dust heating by AGNs and by starbursts have been 
the subject of much recent debate. The first Chandra X- 
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ray images of areas which had been mapped in the sub-mm 
with SCUBA revealed httle overlap between the X-ray and 
sub-mm sources, suggesting that SMGs were not powered 
by obscured QSOs (e.g. Almaini et al. 2003). Deeper X- 
ray images have revealed a much closer correlation between 
X-ray and sub-mm sources, with 30-50% of bright SMGs 
(•Sssoum > 5 mjy) having X-ray-detected AGN counterparts, 
and some others showing X-ray emission consistent with star 
formation (Alexander et al. 2003). However, despite the ap- 
parent commonness of AGNs in SMGs, Alexander et al. ar- 
gue, based on an analysis of SED shapes, that even in the 
sub-mm sources which host AGNs, the AGN contributes on 
average < 2% of the 850/im flux. The SMGs thus appear to 
be powered by star formation, as assumed in our model. 

In our new model, star formation in bursts is very im- 
portant at high redshift; however, bursts are still responsible 
for only 30% of the total star formation when integrated over 
all redshifts, compared to 5% in GOO. Once the recycling of 
mass from dying stars is taken into account, less than 7% of 
the mass locked up in stars today is predicted to have been 
produced in bursts. The new model and the GOO model pro- 
duce similar stellar mass fractions in disks and spheroids; in 
the new model 57% of the stellar mass is in disks and 43% 
is in spheroids. This is in very good agreement with the ob- 
servational estimate by Benson, Frenk & Sharpies (2002a). 
The most massive galaxies in the model tend to be bulge 
dominated, in agreement with observations (e.g. Kochanek 
et al. 2001, Nakamura et al. 2003). However, typically only a 
few per cent of the present day stellar mass in these galaxies 
was formed in bursts at z ~ 2, around the peak in the pre- 
dicted redshift distribution of 850/im sources. In our model, 
the overwhelming bulk of the stellar mass of ellipticals is 
built up in disks and then rearranged into spheroids during 
mergers (Baugh et al. 1996; Kauffmann 1996). 

Our model is able to address many other issues relating 
to sub-mm galaxies and to galaxy evolution generally, which 
will we pursue in subsequent papers in this series. These in- 
clude: a more detailed comparison with galaxy properties 
in the local universe (Lacey et al. 2004a); the environments 
and clustering of SMGs and LBGs (Baugh et al. 2004); a 
more detailed analysis of the properties of LBGs and SMGs 
and of the relationship between them (Lacey et al. 2004b) ; 
the descendants of SMGs (Frenk et al. 2004); and a com- 
prehensive set of predictions for number counts at different 
wavelengths in the IR and sub-mm, and for the extragalactic 
background light (Lacey et al. 2004c). 
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